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Abstract 

In this manuscript, we modulate the binding properties of estrogen receptor protein by rationally modifying the amino acid 
composition of its ligand binding domain. By combining sequence alignment and structural analysis of known estrogen 
receptor-ligand complexes with computational analysis, we were able to predict estrogen receptor mutants with altered 
binding properties. These predictions were experimentally confirmed by producing single point variants with up to an order 
of magnitude increased binding affinity towards some estrogen disrupting chemicals and reaching an half maximal 
inhibitory concentration (IC 50 ) value of 2 nM for the 1 7a-ethinylestradiol ligand. Due to increased affinity and stability, 
utilizing such mutated estrogen receptor instead of the wild type as bio-recognition element would be beneficial in an 
assay or biosensor. 
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Introduction 

The estrogen receptor protein (ER) is a member of the 
superfamily of nuclear receptors [1-3] whose natural ligand is 
the hormone 17P-estradiol. Estrogen receptors are present in all 
vertebrates, highlighting the importance of the ER signal pathway. 
Binding of 1 7 P-estradiol to ER activates a signaling pathway that 
regulates several key biological processes such as reproduction, 
embryonic development [4] and homeostasis [5-7] . There are two 
distinct estrogen receptor genes, resulting in two subtypes of 
estrogen receptors (ER^ and ERp) that differ in tissue distribution 
and ligand preference [8]. In addition to the classical ER ligand 
inducible transcription activity, there are mounting evidence that 
ER can act as extra-nuclear activator, independent of gene 
expression and protein synthesis [9] . These activities are linked to 
the ERs residing in, or near, the plasma membrane and seem to be 
involved in breast cancer development and progression [10]. 

ER is composed by three structural domains: a modulating 
domain with ligand-independent transactivation function, a DNA- 
binding domain (DBD) and a ligand binding domain (LBD) [11- 
1 3] . The amino acid sequences of the LBD of estrogen receptors 
from several species are available and they indicate that the core of 
this domain is highly conserved from mammals to fish [14,15]. 



Even if sequence homology in the ligand binding domain is 
high, several studies indicate that estrogenic compounds may have 
different affinities for ER subtypes [16] and for different organisms 
[17-19]. 

Besides the natural hormone ligands, a large variety of chemical 
compounds (collectively referred to as endocrine disrupting 
chemicals, EDCs) can bind to ERs [20]. Many EDCs have been 
shown to be toxic for animals and humans due to their ability to 
interfere with the normal function of ER, leading to many adverse 
effects such as reproductive problems, hormonal and immune 
system malfunctions, several types of cancer [21-23] and 
feminization in some fish and amphibians [24-31]. 

Thousands of EDCs, belonging to various chemical classes such 
as drugs, pesticides, byproducts of plastic and healthcare 
industries, are commonly present in the environment as a result 
of industrial, agricultural and household waste [32]. EDCs may 
also arise from the degradation pathway of otherwise harmless 
compounds. They are of particular concern due to their wide 
environmental dispersion and to their tendency to bio-accumulate 
[33]. It is not possible to classify a compound as an EDC, and thus 
potentially dangerous, based on its chemical properties alone; 
instead, the ability of such compound to bind to ER and alter its 
function should be investigated. Monitoring the presence of a vast 
number of different EDCs in soil and water and studying their 
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biochemical effect on ER is considered one of the key current 
challenges for ensuring healthy ecosystems in both developed and 
in-development countries. 

For these reasons, analytical methods that exploit ER as a bio- 
recognition element to detect the presence of EDCs are 
particularly attractive: if a chemical binds to ER in the assay, 
then it means that it can potentially interfere with the hormone 
signaling pathways and thus be toxic. 

X-ray structures have shown that the human ER binds ligands 
in a highly hydrophobic pocket that can accommodate EDCs of 
different sizes and chemical properties [34-36]. Due to the high 
sequence identity, it is likely that the general conformation of the 
ER ligand binding site is conserved; however, local structural 
differences and a certain degree of conformational flexibility have 
to be present to account for the different properties and affinities of 
EDC compounds. These differences may, at least in principle, be 
exploited for the rational design of modified receptors capable of 
recognizing classes of EDCs with different affinity and selectivity. 

In this manuscript, we aimed to modulate the binding 
properties of the estrogen receptor protein by rationally modifying 
the amino acid composition of the ligand binding domain. By 
combining sequence alignment and structural analysis of known 
ER-ligand complexes with computational analysis, we were able to 
predict single point variants of the estrogen receptor ligand 
binding domain (ER^ 13 ) with altered binding properties with 
respect to the wild type ER ligand binding domain (wt-ER 0I LBD ). 
These predictions were experimentally confirmed by producing 
and characterizing the most relevant recombinant ER,™ 
variants. In particular, we were able to generate a single point 
ER mutant with a 6-fold increased binding affinity towards some 
EDCs (bisphenolic compounds), reaching i.e. an IC 50 value of 
2 nM for 1 7oc-ethinylestradiol ligand. 1 7a-Ethinylestradiol is an 
orally bio-active hormone and one of the most commonly used 
medications, identified as an emerging aquatic pollutant due to its 
bio-accumulation in surface waters [37,38]. Due to the increased 
affinity of one of our ER variants for this and other compounds, 
utilizing such mutated ER instead of the wt-ERj,™ as bio- 
recognition element in an assay or biosensor would result in 
increased sensitivity. 

Results 

Sequence and structural analysis of Estrogen Receptors 

The full-length Estrogen Receptor a (ER a ) is a protein of 
approximately 65 kDa formed by several independent structural 
domains (Figure 1A). The so-called Ligand Binding Domain 
(LBD, approximately 25 kDa) is necessary and sufficient to bind 
either the natural ligand ( 1 7 fi-estradiol) or EDCs. 

Comparison of the amino acid sequence of several species shows 
that the LBD (corresponding to the polypeptide fragment 303-547) 
is highly conserved. The polypeptide fragment 303-547 of the 
human ER a has been thus cloned (with the addition of a His 6 N- 
terminal sequence for affinity purification) into the expression 
vector pET21a(+) to produce the recombinant protein used in 
these studies: ER a (303-547), or in short ER a LBD . 

X-ray crystallography structures have shown the region of the 
human ER a LBD responsible for EDCs binding [34-36]. EDCs 
bind to a highly hydrophobic pocket that can accommodate 
compounds with different sizes and chemical properties. The 
residues forming the ligand binding pocket are almost universally 
conserved but some intriguing differences are nonetheless present. 

As a first step we defined an ER a LBD residue as belonging to the 
binding pocket if any of its atoms are within 6 A of the ligand in 
any of the x-ray structures of human ER ligand binding domain 



bound to different EDCs of various sizes. The 6 A distance cut-off 
is rather large in order to obtain a conservative description of the 
binding pocket. This identifies a set of 25 non-contiguous amino 
acids (human ER a numbering), shown in Figure IB. 

We then compared the sequence alignment of these residues in 
more than 200 ER proteins from different species. Most residues 
have 100% conservation with a few exceptions noted below 
(Figure IB). Position 349 is predominandy occupied by a 
methionine (M), but it can also be leucine (L) as in the case of 
the human protein or, more rarely, valine (V). All three possible 
variants in position 349 (M, L, V) have aliphatic side chains with 
similar size and properties. Computational simulations suggest that 
these mutations should not have a great impact on the binding 
pocket. 

Residue 421 is usually a methionine (M) but it is replaced by 
isoleucine (I) or leucine (L) in several species, a rather conservative 
mutation once again. In 22 of the analyzed ER sequences, 
however, residue 421 is a phenylalanine (F), which is still 
hydrophobic but has properties not shared by the other amino 
acids. First of all, F421 introduces an aromatic residue in the 
binding pocket and it may increase the affinity of the receptor for 
aromatic compounds such as, for instance, bisphenols. In second 
order, introducing the large phenylalanine side chain may reduce 
the size of the ligand binding pocket. The F421-ER a , thus, 
may be unable to bind large EDCs due to steric clashes with the 
phenylalanine aromatic ring. Alternatively, structural changes in 
the binding pocket may be required to accommodate large EDCs. 

Less frequent and conservative amino acid substitutions are also 
present at position 350, 351, 522, and 525. 

Computational design of mutant ER receptors 

To rationally design ER receptors with altered binding 
properties towards different classes of EDCs, we predicted the 
structural effects of introducing single point mutations in the ER a 
binding pocket. The first candidates for virtual mutations analysis 
were those amino acids that exist as natural variants in different 
species. This strategy should increase the probability that the 
resulting mutated protein is functional and properly folded. To this 
end, the most interesting starting point seemed to be position 42 1 , 
for which there are existing natural variants carrying either M, L, 
I, or F amino acids, which provide opportunities for both 
conservative (M421L, M421I) and non-conservative (M421F) 
mutations. We used computational approaches to predict: the 
structure of each ER mutant of interest (M421F, M421I, M421L); 
the structure of the complexes between each mutant and either the 
natural 1 7 fi-estradiol ligand or some selected EDCs. 

To predict the structure of the unbound, mutated ER a we 
either i) replaced the side chain of the interested residue and used 
rotamer libraries to define its new orientation or ii) predicted the 
whole protein structure by homology modeling. No significant 
structural differences were found between wt and mutated ER 
when unbound. This is not surprising, since even the largest side- 
chain (F421) can be accommodated in the existing binding pocket 
if this is not occupied by ligands. 

We then moved to predict the three-dimensional structure of 
the complex between wild type and mutated ER and selected 
EDC ligands. As a first test to assess the accuracy of our docking 
approach, we predicted the structures of 17 fi-estradiol and 
bisphenol-A with human wt-ER (Figure 2A). X-ray structures of 
each of these are available and the computational results were in 
very good agreement with the experimental data, supporting the 
idea that computational predictions can be used to assess the 
impact of the mutations. We then proceeded to predict the 
structure of the above mentioned compounds in complex with the 
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Figure 1. Schematic representation of the structural domains of ER protein. A) A cartoon representation of the three-dimensional structure 
of the ligand binding domain is shown, as well as its sequence. Residues belonging to the ligand binding pocket are shown in red in both the 
structure and sequence. Residues highlighted in yellow belong to the histidine tag, residues in light blue encompass the ligand binding domain. B) 
Degree of conservation for residues of the ligand binding pocket among the analyzed ER sequences. Full bars correspond to 100% conservation. 
doi:1 0.1 371 /journal.pone.01 02658.g001 



ER mutants. We used two different docking algorithms with either 
rigid or flexible docking options. In the latter case the side-chains 
and backbone of the protein are allowed to move, trying to 
accommodate local structural rearrangements upon EDC binding. 

It should be pointed out, however, that large changes in the ER 
structure cannot be correctly simulated and would not be detected 
by this approach. No significant differences amongst the various 
procedures were noted. 

Ligand binding to M421 F-ER a LBD 

We suspected that introduction of an aromatic ring in the 
binding pocket (M42 1 F mutant) could increase the affinity of ER 
for aromatic compounds. Indeed, computational docking predic- 
tions suggest that bisphenol-A bound to M421F-ER ot LBD changes 
in comparison to its complex with wt-ER a LBD , bringing one of its 
aromatic rings closer to the ring of F42 1 (Figure 2B), something 
that cannot be achieved with wt-ER,,, 1 ^ 0 (Figure 2A) This may 
result in a higher affinity of bisphenol-A for M421F-ER a LBD , 
either due to the formation of direct aromatic stacking interactions 
or as a consequence of the increased aromatic character of the 
mutated binding pocket. 

Phenolic compounds linked to bulkier aliphatic side chains (e.g. 
4-nonylphenol) are somehow different: although they can fit in the 
mutated ligand binding pocket just like wt-ER a LBD , their larger 



size does not allow their aromatic ring to get closer to M42 1 F as it 
happens to bisphenol-A (Figure 2C). We wouldn't expect, 
therefore, an increased affinity of M42 1 F-ERo, LBD for these 
compounds. 

The natural ER ligand, 1 7 (5-Estradiol, has an aromatic ring that 
may be favorably affected by the M42 1 F substitution in a similar 
way to bisphenols (Figure 2D). However, the aromatic ring is far 
away from M421 in the wt-ER a LBD experimental structure; if 
F421 would maintain the same conformation, it would be unable 
to achieve direct aromatic stacking interactions with 1 7 P-Estradiol. 
On the other hand, if 1 7 (3-estradiol would bind to F421 in a 
position different from the one it has in wt-ER QI LBD , then the 
M42 1 F-ERj™ could have increased affinity for 17|3-estradiol 
due to favorable interactions between the aromatic rings. 
Computational docking of 1 7 (5-estradiol in complex with 
M42 1 F-ERc, 1 ^ 13 shows a binding conformation comparable to 
that of wt-ERa 1 " 813 in the models considered more energetically 
favorable by the algorithm (Figure 2D). Models deemed to be less 
stable, however, position the aromatic ring of 17|3-estradiol in 
proximity of the ring of F421, which may result in increased 
binding affinity. Since computational algorithms are notoriously 
unreliable at correctly identifying energetically favored ligand 
positions within the same binding region, it can be suggested that 
1 7 P-estradiol can fit in the mutated binding pocket with its 
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Figure 2. Computational docking models of ligands in the ER binding pocket. The protein backbone is shown as grey cartoon; ligands and 
side chain of residue 421 are shown as sticks; hydrogen atoms are not shown for clarity. M421 F-ER a LBD mutant bound to bisphenol-A (orange in B), 4- 
nonylphenol (green in C) and 17(3-estradiol (red in D). In contrast to the wt-ER a LBD (shown in A with bisphenol A), the aromatic ring of the F421 
mutated ER may increase the affinity for the ligands either by direct stacking interactions or by increasing the aromatic content of the ligand binding 
pocket; no steric clashes are created by the incorporation of this larger side chain. 
doi:10.1371/journal.pone.0102658.g002 



aromatic ring close to F421. Whether this conformation is 
preferred in vivo remains to be seen. 

Finally, computational docking of tamoxifen (a commercial 
drug that binds to ER) shows no appreciable differences between 
wt-ERo, LBD and M421F-ER OI LBD . Although tamoxifen is much 
larger than the natural 1 7 fi-estradiol ligand, x-rays structure has 
shown it to only partially occupy the ligand binding pocket [34], 
which can thus probably be restricted without affecting the ligand. 

Production and characterization of ER mutants 

Structural and computational analysis of the wt-ER a LBD 
binding pocket suggested that mutations in position 421 could 
have a significant influence on the affinities of different EDCs 
towards the ER. In order to verify this hypothesis by experimental 



measures of binding affinity we generated three ER mutants: 
M421F-ER C! LBD , M421I-ER a LBD , and M421L-ER OI LBD . 

High yield of proteins, ranging from 25 to 30 mg/L of liquid 
culture, were obtained. Following a single purification step by 
affinity chromatography pure proteins samples were obtained, as 
shown in the SDS-PAGE of the various ER mutants (inset in 
Figures 3A, B and C). 

CD spectroscopy confirms that the mutants are properly folded 
(Figures 3B and 3C) and comparable to wt-ER a LBD (Figure 3A). 
The percentage of secondary structure elements were calculated 
using standard algorhythms provided by the online DichroWeb 
service [39]. The calculations showed the presence of 54% ot- 
helical secondary structure for all the wild-type and the mutants 
which reflects well the secondary structure elements derived from 
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Figure 3. Circular dichroism and SDS-PAGE of recombinant ER proteins. CD spectra show typical -/-helical character and a single band at the 
expected molecular weight is visible (insets). (A) Far-UV CD and SDS-PAGE of wt-ER a LBD (B) Far-UV CD and SDS-PAGE of M421 F-ER a LBD (C) Far-UV CD 
and SDS-PAGE of M421 l-ER a LBD (D) - (F): thermal unfolding data of the three proteins. Intensity of peak at 222 nm plotted as a function of increased 
temperature. Melting temperature (T m ) reported. 
doi:1 0.1 371 /journal.pone.01 02658.g003 



the available three dimensional structure of wt-ER a (PDB: 
1ERE). 

The thermal stability of the different ERs was assessed by 
following the protein thermal unfolding between 25 and 90°C with 
CD spectroscopy (Figures 3D, 3E and 3F). The unfolding process 
is not reversible: by cooling back the samples to 25°C only around 
50% of the original secondary structure of the protein is recovered 
(data not shown). By fitting the experimental data with a 
Boltzmann-type equation it was possible to calculate melting 
temperature (T m ) values of 59.5±1.5°C, 65.8±1.4°C, and 
61.5±1.3°C for the wt-ER a LBD , M42 1 F-ER a LBD , and M421I- 
ERo, 1 ^ 0 proteins, respectively. 

The M421L-ER„ mutant had the same characteristics of 
purity, secondary structure and melting temperature of M42 1 1- 
ERo, LBD (data not shown). 

The melting temperature of the wt-ER a LBD receptor is about 
20°C higher than that previously reported for the full-length ER a 
protein [40,41]. The isolated ligand binding domain, in other 
words, appears more stable than the full length ER protein. It is 
also interesting to note that it is reported that the CD signal 
decreases sharply as the temperature increases from 20 to 30°C 
and then increases gradually from 30 to 70°C. We, instead, 
observe a single transition sigmoidal trend. Our results also 
indicate that the M42 1 F-ER a LBD mutant is significantly more 
stable than the wt-ER^ 0 . 



Competitive binding assay 

The PolarScreen assay was used to test the ligand binding 
affinity of wt-ER QI LBD and its mutants towards different classes of 
estrogen disrupting compounds. The tested chemical compounds 
(shown in Figure 4) were: the natural ligand 1 7 [(-estradiol; its close 
analog 1 7ot-ethynilestradiol (chosen since it is an emerging 
pollutant [42]); tamoxifen, a commercial drug antagonist often 
detected in surface waters [43]; and three other compounds with 
known estrogen disrupting activity and significant presence in the 
environment, bisphenol-A, 4-nonylphenol and 4-tert-octylphenol 
[44]. The latter three have micromolar binding affinity whereas 
the first three have nanomolar affinity for wild type ER [45] . 

As a first test we compared the binding affinity of our wt- 
ER 0( LBD protein with that of the full length ER supplied with the 
kit. The binding affinities for the 1 7 fi-Estradiol ligand were 
measured as 1 2 ± 2 nM and 1 6 ± 4 nM for the full length and wt- 
ERo, LBD respectively, indicating that the isolated ligand binding 
domain is just as active as the full protein in terms of ligand 
binding. 

After this control, we measured the binding affinity of the above 
mentioned compounds for the recombinant receptors (wt and 
mutants; results are shown in Figure 5 and Table 1). 

Each compound was tested in a range of 1 2 concentrations with 
the ER receptors (Figure 5). The compounds concentrations (X 
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Table 1. IC 50 values and standard errors resulting from the competitive binding assay performed in four replicates with wt-ER a 
(column 2), M421F- ER a LBD (column 3) and M421I- ER a LBD (column 4) and selected compounds (ligands, column 1): 17|3-estradiol, 
17a-ethinylestradiol, bisphenol-A, tamoxifen, 4-nonylphenol and 4-tert-octylphenol. 





"C 50 


LIGANDS 


wt-ER„ LBD 


M421F-ER„ LBD 


M421 l-ER„ LBD 


17p-Estradiol 


16±4 nM 


7±2 nM 


191 ±50 nM 


1 7a-Ethinylestradiol 


12±6 nM 


2.1 ±0.9 nM 


90±21 nM 


Bisphenol-A 


11 ±2.5 |xM 


1.9±0.5 |xM 


100±9 ]xM 


Tamoxifen 


47±14 nM 


55±40 nM 


830±300 nM 


4-Nonylphenol 


8±1 |iM 


4±2 |lM 


60 ±30 ]xM 


4-Tert-octylphenol 


11.4±1.5 fiM 


10±4 \lM 


200±60 flM 



The M421 F-ER a LBD shows lower IC 50 , compared to wt-ER a LBD , for four compounds (1 7p-estradiol, 17a-ethinylestradiol, bisphenol-A and 4-nonylphenol) out of six. 
The M421l-ER a LBD has higher IC 50 , compared to wt-ER^ 80 , for all the tested compounds. 
doi:1 0.1 371 /journal.pone.01 02658.t001 



axis) were plotted in a logarithmic scale against the Polarization 
values (Y axis). The plot of compounds concentrations against 
Polarization were fitted with a one site competition four 
parameters logistic curve (see Materials and Methods for further 
details). 

The half maximal inhibitory concentration (IC 50 ) values of the 
wt-ER,,™ towards the selected compounds agreed with literature 
data [18,46]. Differences in protein preparation and assay 
conditions may contribute to the small variability seen among 
literature and present data. 

The IC 30 of the mutant M421F-ER a LBD (red circles in Figure 5) 
with most compounds was different from the wt-ERa 030 (blue 
squares in Figure 5). In particular, IC 50 values indicate that 17a- 
ethinylestradiol and bisphenol-A bind approximately 6 times more 
strongly to M42 1 F-ER 0! LBD than to wt-ER a LBD (Figure 5 and 
Table 1). 1 7 P-Estradiol and 4-nonylphenol bind twice as strong to 
M421F-ER„ , as well. No differences were observed for 
tamoxifen and 4-tert-octylphenol whose IC50 values are not 
significandy different between M421F-ER a LBD and wt-ERo," 313 
(Figure 5 panels D and F). 

Comparing the IC50 values of the mutant M421I-ER a 
(green triangles in Figure 5) with the wt-ER a LBD (blue squares in 
Figure 5) we observe, instead, that in all cases the mutation 
negatively affects the binding affinity, with IC50 values between 5 
and 20 times higher. The M42 1 L-ER a LBD gave the same results as 
M42 1 1-ERo! 1 " 513 and for this reason was not further considered. 
Repeated measurements were performed over a 9 months period 
with proteins stored at — 20°C; the low standard errors suggest 
that the proteins are stable, retaining not only their tertiary 
structure but also their binding ability for at least 9 months. This is 
an encouraging observation about the possibility to use these 
receptors as bio-recognition elements in an assay to detect EDCs. 

Discussion 

The possibility of rationally modify ER receptors with improved 
affinities towards different classes of endocrine disrupting chem- 
icals would allow both the development of analytical tools for the 
rapid screening and detection of EDCs and a better understanding 
of the their structure-activity relationship. To this end the 
combination of structural analysis and computational modeling 
allowed the design of mutants predicted to have different affinities 
towards different EDC classes. Indeed, we obtained mutated ERs 
with 2 to 6 fold increased binding affinity for 17(3-estradiol, 17a- 
ethinylestradiol, bis-phenol-A and 4-nonylphenol. 



Computational docking simulations suggested that the mutant 
M42 1 F-ERj™ would have improved binding affinity for at least 
some chemical compounds, as it was later verified by experimental 
results. Generally speaking, docking results cannot be taken at face 
value, especially when comparing binding energies. However, they 
do provide useful hints about what is feasible and what not, thus 
guiding the rational design of mutants. In the case of M421F- 
ER 0( LBD , simulations indicated that introduction of an aromatic 
side chain could favor binding of aromatic compounds, either 
through direct stacking interactions between the aromatic rings of 
chemicals and protein side chains or simply by increasing the 
overall aromatic content of the ligand binding pocket. It is also 
possible that the mutation stabilizes transient intermediates that 
facilitate passage of the organic compound from the media to the 
buried ligand binding pocket, although we have no direct evidence 
for this. 

In contrast to the M421F mutation, the M421I restricts the 
binding pocket (Isoleucine has a bulkier side chain than 
Methionine) without increasing the aromatic content. This reduces 
the binding affinities of all the tested compounds by 8 to 20 folds. 

The in-silico designed mutants, M421F-ER a LBD and M421I- 
ERo, LBD , were properly folded and active in solution, as shown by 
their CD spectra and binding properties. These findings highlight 
the advantages of sequence analysis and the use of mutations that 
are rare but yet present as natural variants in some species. This 
approach highly increases the possibility that the resulting mutants 
are properly folded and active, something that is not always 
guaranteed when mutants are generated through randomization 
technologies or other strategies. As an added bonus, the isolated 
ligand binding domain that we used proved to be more stable than 
the full-length ER protein used in other published or commercial 
assays. Introduction of the rational mutation M421F further 
increased protein stability, both in terms of increased resistance to 
thermal denaturation (as proved by the increase of the melting 
temperature from 60 to 66°C) and of ligand binding activity after 
prolonged storage. 

The M421F-ER a with increased stability and higher binding 
affinities towards selected EDCs could be used as the capture 
agent in affinity chromatography. For example, our recombinant 
ERo, LBD receptor was recently used for the fast detection of 
estrogens in dietary supplements [47]. In addition, the availability 
of ER proteins with variable binding affinities (i.e. wild type, 
M42 1 F and M42 1 1 mutants) could be used in array format assays 
to not only detect EDCs but also to discriminate amongst different 
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EDC classes. Moreover, this kind of receptors can be used as bio- 
recognition element for label-free detection by means of highly 
sensible techniques, such as those based on SPR (Surface Plasmon 
Resonance), QCM (Quartz Crystal Microbalance) or MC (Micro 
Cantilever) [48]. Such detection systems could be then applied for 
the EDC screening in complex matrices such as food, aquaculture, 
fresh and seawater as well as for screening of chemicals with 
potential EDC activity. 

Our results indicate, as a proof of concept, that the combination 
of structural and sequence analysis with computational simula- 
tions, allow the successful rational design of ER mutants with 
desired binding properties. We think that the workflow illustrated 
in this manuscript could be successfully applied to the rational 
design of other ER mutants or to the modification of other ligand 
binding proteins. 

Materials and Methods 

Materials 

The wild-type estrogen receptor ot ligand binding domain 
encoding gene was synthesized and cloned into pET21a(+) by 
Genscript (Piscataway, NJ, USA). The QuikChange Site-Directed- 
Mutagenesis Kit was purchased by Qiagen (Hilden, Germany), 
while primers and sequencing were done by Primm (Milan, Italy). 

The Bacto Yeast Extract and Bacto Tryptone for the Luria 
Bertani (LB) medium were from BD Biosciences (Franklin Lakes, 
NJ, USA). Rosetta 2(DE3)pLysS Competent Cells come from 
Merck. 

Isopropyl p-D- 1 -thiogalactopyranoside (IPTG), Tris-HCl, 
NaCl, |3-mercaptoethanol (P-Me), 3-( 1 -pyridinio)- 1 -propanesulfo- 
nate (NDSB) and the test compounds 17|3-estradiol, 17a- 
ethynilestradiol, bisphenol-A, tamoxifen, 4-nonylphenol and 4- 
tert-octylphenol were supplied by Sigma Aldrich. 

Nickel HiLoad Column and the AKTA System purifier are 
from GE Healthcare. Life Technologies supplied the PolarScreen 
Estrogen Receptor-oc Competitor Green Assay. 

Alignment and Computational analysis 

Estrogen receptor protein sequences from several species were 
retrieved through the Expasy protein server database (http:// 
www.expasy.org/). Sequence alignment was performed according 
to standard procedures using the Jalview program suite and 
included algorithms (http://www.jalview.org/). Duplicate and 
incomplete sequences were manually removed from the analysis. 

Structural predictions of mutated ER. 

The X-ray structure of a human estrogen receptor complexed 
with bisphenol-A (PDB: 3UU7) was used as a starting PDB file. 
Mutants model structures were predicted by Pymol [49] using the 
wild type as a template structure and changing the side chain 
conformation according to rotamer libraries. Homology modeling 
was performed with the i-Tasser web server [50,51]. The PDB files 
of wild type and mutant model structures were prepared for 
docking using the dockprep tool in UCSF Chimera package [52], 
while bisphenol-A was prepared using MarvinSketch (Marvin 
5.9.0, http://www.chemaxon.com/) and OpenBabel [53]. Com- 
putational docking was performed by means of SwissDock server 
(www.swissdock.ch) based on the docking software EADock DSS 
[54], with a user-defined box (15Ax 15 Ax 15 A) centered on the 
receptor binding site of the LDB. Briefly, a tree-based Dihedral 
Space Sampling (DSS) algorithm generates 15000 binding modes 
that are subsequendy minimized (100 steps of steepest descent 
algorithm and 250 steps of adopted basis Newton Raphson 
algorithm). Redundant binding modes and binding modes making 



littie or detrimental interactions with the protein are removed. 
Simultaneously, the CHARMM [55] energies of the remaining 
binding modes are estimated on a grid. Then, binding modes with 
the most favorable energies are ranked, taking account of the 
solvent effect using the FACTS implicit solvation model [56], and 
clustered by root mean square deviation (RMSD) with a distance 
cutoff of 2 A. Finally, the most favorable clusters are ranked by an 
estimated AG of binding. Selected calculations were repeated with 
the docking program Autodock 4.2 [57] with default options and 
similar search boundaries as above. Docking was performed either 
with rigid or flexible body. The latter option allows for limited 
movement of protein side chain and backbone. No significant 
differences were noted between the various algorithms. Subse- 
quent molecular graphics and analyses of docking results were 
performed with the UCSF Chimera package [52], or with Pymol 
[49]. 3D structures were analyzed with the program Pymol. 

In an attempt to cover the different conformations allowed to 
the F42 1 mutant, we generated four different models in which the 
aromatic side-chain occupies four different positions, defined by 
either rotamer libraries (using Pymol) or de novo structural 
predictions (using the iTASSER server). We then used each of 
these models independently in subsequent docking simulations. As 
a validation step for our approach we docked EDC/ER complexes 
with known experimental structure. Evaluation of the spatial 
distance of the ligand position between experimental and 
predicted structure (RMSD) showed that solutions nearly identical 
to the experimental ones were selected by the computational 
algorithm as most energetically favorable. This suggests that the 
approach can give reliable information also for complexes with 
unknown experimental structure. 

Generation of mutated plasmids 

In order to selectively mutate the methionine 421 of the wt- 
ER a LBD into phenilalanine, leucine or isoleucine, the plasmid 
containing the wild type gene was used as starting material. 

To generate the M421F, M421I and M421L variants two 
mutagenic primers (forward and reverse) were designed for each 
single mutation according to indications on the QuikChange Site- 
Directed-Mutagenesis Kit (Qiagen) with a length between 25 and 
45 base pairs, T m >78°C. 

Forward Primer M421F: 5'- AGGCAAATGCGTC- 
GAGGGTTTTGTGGAAATTTTTGACATGC-3 ' 

Reverse Primer M421F: 5'- GC ATGTC AAAAATTTC C A- 
CAAAACCCTCGACGCATTTGCCT -3' 

Forward Primer M421I: 5 ' -GC AAATGC GTC GAGGG- 
TATTGTGGAAATTTTTGACATG-3 ' 

Reverse Primer M421I: 5'-CATGTCAAAAATTTCCACAA- 
TACCCTCGACGCATTTGC-3' 

Forward Primer M421L: 5 ' -ATC AAGGC AAATGCGTC- 
GAGGGTCTGGTGGAAATTT-3 ' 

Reverse Primer M421L: 5'-AAATTTCCACCAGACCCTC- 
GACGCATTTGCCTTGAT-3' 

Each reaction volume contained the reaction buffer, plasmid 
pET2 1 a(4-)-wt-ER 0( LBD , the two primers, dNTP mix, and water. 
The PFU-Turbo polymerase (QuikChange Site-Directed-Muta- 
genesis Kit, Qiagen) was added and the PCR reaction was 
performed in a PCR Thermocycler (Stratagene). After the PCR, 
the Dpnl enzyme (Qiagen) was added to digest the parent DNA 
and then the mixture was used to transform XL 1 Blue competents 
cells (Qiagen). The extracted plasmids from the positive colonies 
were sequenced (Primm). 
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Protein expression and purification 

The pET21a(+) plasmid containing the ER gene was used to 
transform Rosetta 2(DE3)pLysS Competent Cells (Merck), plated 
on ampicillin selective LB-agar and incubated over night at 37°C. 
Different E.coli strains, such as One Shot BL21 Star (DE3), 
Rosetta 2(DE3) and Rosetta 2(DE3) pLysS were tested and the 
latter ones were chosen because they showed the best yield in 
terms of purity and amount (mg/1 culture) of the expressed protein 
due to the pLysS plasmid that expresses T7 lysozyme [58] . 

The cells are then grown in liquid culture (LB with 10% sucrose) 
at 37°C until an OD 600 of 0.7 is reached, then IPTG 0.5 mM and 
3% ethanol are added. The protocol was modified starting from 
the one of Ahrens et al. [59]. Afterwards cells are grown at 18°C 
for 18 hours; washed once in buffer A (50 mM Tris-HCl, 50 mM 
NaCl, 10 mM P-mercaptoethanol, 10 mM ((3-( 1 -pyridinio)- 1 - 
propanesulfonate), 1 tablet of complete protease inhibitor, and 
20 mM imidazole), then sonicated and centrifuged at 18000 rpm 
for 1 h at 4°C. The supernatant is then loaded on a 5 mL Nickel 
HiLoad Column (GE Healthcare) connected to an AKTA system 
purifier while the pellet was resuspended in Tris-HCl 50 mM 
pH 7.5 and Urea 6 M (5 mL per liter of culture) to extract 
additional proteins from the cell debris and left reacting for 5 
minutes, then centrifuged at 18000 rpm for 1 h at 4°C. 

The Nickel HiLoad column is equilibrated with washing buffer 
IX (50 mM Tris-HCl pH 7.5, 500 mM NaCl, 20 mM imidazole, 
10 mM P-mercaptoethanol) prior to supernatant loading. After 
washing out the unbound proteins with 4 column volumes 
(20 mL), the elution of the His-tagged protein is performed by a 
linear gradient from 0 to 100% of elution buffer (50 mM Tris-HCl 
pH 7.5, 300 mM NaCl, 500 mM imidazole, 10 mM P-mercap- 
toethanol). The fractions that absorb at 280 nm are run on a SDS- 
PAGE gel to visualize the presence of the protein. 

The purified fractions containing the protein are dialyzed in 
storage buffer (Tris-HCl 20 mM pH 7.4 and NaCl 150 mM) to 
remove the imidazole and to lower the salt concentration. The 
purified protein was then stored at — 20°C. 

Protein characterization with Circular Dichroism (CD) 
spectroscopy 

Circular dichroism spectra were recorded with a J750 Circular 
dichroism spectrometer from Jasco interfaced with a Peltier 
temperature control unit with 1 mm path length cuvette. Analysis 
of the far-UV region (185-260 nm) was used to investigate the 
secondary structure and the folding of the protein. CD spectra 
were analyzed with the DichroWeb [39] online software, by means 
of the CDSSTR program [60]. Thermal denaturation experi- 
ments were performed by monitoring the circular dichroism at 
222 nm while changing the temperature from 25 to 90°C (1°C/ 
min) and backward. Reference spectra were collected for the 
buffers in which proteins were dissolved. Denaturation curves were 
analyzed assuming a two-state unfolding model and melting 
temperature (T m ) was calculated for every mutant [61]. All the 
statistical analysis and fittings were performed with Sigma Plot 
12.3. 
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Competitive binding assay 

To test the binding affinity of the recombinant receptor for 
EDCs we used the competitive binding assay developed by Life 
Technologies for ER^, the PolarScreen Estrogen Receptor-oc 
Competitor Green Assay. Although radioactive binding assays are 
widely used for ER, fluorescence assays are just as reliable but do 
not require radioactive safety procedures making them more 
convenient. 

The full length ER a provided with the kit was replaced by our 
recombinant ER a . In short, wt or mutant ER a are added to 
the Fluormone and incubated at 4°C for 45 minutes to form the 
receptor-Fluormone complex. This complex is then mixed and 
incubated at 25°C for 2 h with the individual EDCs and the 
intensity of the fluorescence polarization signal (directly propor- 
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P=(F\\-F±)/(F\\+F±) 

Where: 

Fll = Fluorescence intensity parallel to the excitation plane, 
F j_ = Fluorescence intensity perpendicular to the excitation 
plane. 

The P data (y) were fitted against the competitor concentrations 
(x) with a typical one site competition three parameters logistic 
curve: 

y= min + [(max - min)/(l + 10 x_io?/c 50)] 

Where: 

min and max are respectively the minimum and maximum 
polarization values, LoglCso = Log of the concentration of test 
compound required to reduce the maximum polarization value by 
50%. All the resulting IC 50 values are obtained by the average of 
at least four different experiments. 
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